Performance analysis of massively parallel programs for graphics processing units

نویسندگان

چکیده

Any modern Graphics Processing Unit (graphics card) is a good platform to run massively parallel programs. Still, we lack tools observe and measure performance characteristics of GPU-based software. We state that due complex memory hierarchy thou- sands execution threads the all issues are about efficient use graphics card hierarchy. propose GPGPUSim simulator, previously used mostly for architecture validation, validation CUDA-based program. provide examples which show how simulation analysis

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Massively parallel chemical potential calculation on graphics processing units

Oneand two-stage free energy methods are common approaches for calculating the chemical potential from a molecular dynamics or Monte Carlo molecular simulation trajectory. Although these methods require significant amounts of CPU time spent on post-simulation analysis, this analysis step is wellsuited for parallel execution. In this work, we implement this analysis step on graphics processing u...

متن کامل

Parallel Genetic Programming on Graphics Processing Units

In program inference, the evaluation of how well a candidate solution solves a certain task is usually a computationally intensive procedure. Most of the time, the evaluation involves either submitting the program to a simulation process or testing its behavior on many input arguments; both situations may turn out to be very time-consuming. Things get worse when the optimization algorithm needs...

متن کامل

Rigid body constraints realized in massively-parallel molecular dynamics on graphics processing units

a r t i c l e i n f o a b s t r a c t Molecular dynamics (MD) methods compute the trajectory of a system of point particles in response to a potential function by numerically integrating Newton's equations of motion. Extending these basic methods with rigid body constraints enables composite particles with complex shapes such as anisotropic nanoparticles, grains, molecules, and rigid proteins t...

متن کامل

Algorithmic performance studies on graphics processing units

We report on our experience with integrating and using graphics processing units (GPUs) as fast parallel floatingpoint co-processors to accelerate two fundamental computational scientific kernels on the GPU: sparse direct factorization and nonlinear interior-point optimization. Since a full re-implementation of these complex kernels is typically not feasible, we identify the matrix-matrix multi...

متن کامل

Strategies for Parallel Ant Colony Optimization on Graphics Processing Units

Ant colony algorithms are known to have a significant ability of finding high-quality solutions in a reasonable time [2]. However, the computational time of these methods is seriously compromised when the current instance of the problem has a high dimension and/or is hard to solve. In this line, a significant amount of research has been done in order to reduce computation time and improve the s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Problemy programmirovaniâ

سال: 2022

ISSN: ['1727-4907']

DOI: https://doi.org/10.15407/pp2022.03-04.051